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MIGftOBIOIiO argAL PRODUCTION METHOD FOR fl-L-A8PARTYL^L. 

PHBNYIAIAN1NB 



FIELD QW THE IHYEMTIQN 



The present invention relates to a new, 
10 microbiological, method for the production of a-L- 

aspartyl-L-phenylalanine (Asp-Phe) from the substrates 
L-aspartic acid (L-Asp) and L-phenylalanine (L-Phe) . 
The present invention also relates to novel DMA 
fragments or combination of DNA fragments encoding a new 
15 Asp-Phe dipeptide synthetase, micro-organisms containing 
such DNA fragments, as well as to the new Asp-Phe 
dipeptide synthetases iteelf. 



20 BACKGROUND OF THE INVENTION 

a-L-Aspartyl -L-phenylalanine (hereinafter 
also referred to as Asp-Phe) is an important dipeptide, 
inter alia used for the production of a-L-aspartyl-L- 
25 phenylalanine methyl ester (hereinafter also referred 
to as APM) . APM is known to be a high intensity 
k artificial sweetener, having a sweetness which is about 

• 200x as potent as the sweetness of sucrose. The P-form 

i of APM, ae well as the stereoisomers of APM wherein one 

30 or both of the amino acids are in the D-conf iguration, 
do not have sweet properties. APM is used for the 
sweetening of various edible materials. 
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Various production methods of APM exist; 



present routes may be divided into chemical and 
biochemical/microbiological (in particular, enzymatic) 
routes . In the ways of producing AS>M by using known 
5 peptide synthesis techniques tedious and expensive 
processes have to be performed in order to achieve 
selective a -L, L- coupling, involving intensive protecting 
and deprotecting of oc-amino, carboxyl and side chain 
groups* Fermentative routes, on the other hand, in 
10 general are cheap and intrinsically they display 

enancio- and regioselectivity . Therefore, f ermetitative 
routes have been considered to be promising alternatives 
for the above-mentioned chemical and biochemical 
synthesis routes. As can be seen from EP-A- 0036258 , it 
15 has so far been deemed unsuited to produce the dipeptide 
Asp-Phe in a micro-organism as part of the micro- 
organism's own protein producing processes; 
theoretically eu'ch production might be achieved by - 
inserting in the DNA of a micro-organism the nucleotide 
2 0 base sequences GAC or GAT (being known to be a codon for 
L-Asp) and TTT or TTC (being known to be a codon for L- 
Phe) , preceded and followed by appropriate processing or 
termination codons in the correct reading frame, and 
under appropriate control. It therefore has been 
25 attempted in EP-A-0036258 to achieve tne synthesis of 
Asp-Phe indirectly through prior production of protein 
segments of the formula (Asp-Phe) A , where n is a large 
number; this has been done by inserting into a cloning 
vehicle a synthesised DNA- fragment coding for such poly- 
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(Asp-Phe) protein. However, such ribosomal fermentative 
route is still tedious and economical ly unattractive. 
Major drawbacks are lying in the recovery of the Asp-Phe 
dipeptide from the polypeptide. Similar drawbacks can be 
5 attributed to a method, as described by Choi, S.-Y. et 
al. in J. Microbiol. Biotechnol., 2, 1992, p.l-S, 
wherein a polypeptide comprising segments of the 
tripeptide sequence Asp-Phe-Lys is synthesised* 
Therefore still need exists for finding a direct 
10 fermentative route to Asp-Phe. Direct fermentation of 
Asp-Phe is hitherto unknown. 

DESCRIPTION* OF THE INVENTION 

IS 

MSTHOD TOR THg PRQDtTCTION OF ASP-PHE t 

Surprisingly, inventors now found a new, and 

promising alternative microbiological -method for. the 

production of a-L~aspartyl-L-phenylalanine (Asp-Phe) 

20 from the substrates L-aepartic acid (L-Asp) and L- 
phenylalanine (L-Phe) wherein the substrates are 
contacted, in the presence of an effective amount of 
adenosine- triphosphate (ATP) , with a non- ribosomal 
dipeptide synthetase comprising two minimal modules 

25 connected by one condensation domain wherein the N- 
terrninal module of these modules is recognising L- 
aspartic acid and the C-terminal module of these 
modules is recognising L-phenylalanine and is 
covalently bound at its Hi- terminal end to the 




condensation domain, and wherein each off these minimal 
modules is composed of an adenylation domain and a 4'- 
phosphopantetheinyl cofactor containing thiolation 
domain, and that the ot-L-aspartyl-L-phenylalanine (Asp- 
Phe) formed is recovered. 

This new method thus provides a 
microbiological process for direct fermentation of Asp- 
Phe, which in a subsequent rnethylation step may be 
converted into the intense sweetener aspartame. 
Production of Asp-Phe by direct fermentation is hitherto 
unknown, as is non-ribosomal synthesis of this 
dipeptide. The inventors thus have provided a direct 
microbiological method for producing the dipeptide Asp- 
Phe without the need for any protecting and deprotecting 
steps . 

The novel non-ribosomal dipeptide 
synthetases which, according to the present invention, 

-can-be -used^for *the -production of Asp-Phe are- also-- 

indicated hereinafter as Asp-Phe dipeptide synthetases 
or as Asp-Phe synthetases. It is known (for instance, 
from P. Zuber et al . , in "Bacillus subtiiis and other 
Gram-positive bacteria", Sonenshein et al . (Eds.), Am. 
Soc. Microbiol Washington, DC, 1993, p. 897-916) that 
micro-organisms can produce bioactive peptides through 
ribosomal and non-ribosomal mechanisms. The bioactive 
peptides so far known to be synthesised non- 
ribosomal ly, are produced by a number of soil bacteria 
and fungi . These bioactive peptides can range from 2 to 
48 residues, and are structurally diverse. They may 
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show a broad spectrum of biological properties 
including antimicrobial, antiviral or antitumor 
activities, or immunosuppressive or en2yme- inhibiting 
activities. As such, these ncn-ribosomaliy synthesised 
5 bioactive peptides form a class of peptide secondary 
metabolites which has found widespread use in medicine, 
agriculture, and biological research. Already more than 
300 different residues so far have been found to be 
incorporated into these peptide secondary metabolites. 

10 However, until now not a single non-ribosomally formed 
peptide has been identified having (as a part of its 
peptide sequence) the dipeptide Asp-Phe in it, ncr has 
the dipeptide Asp-Phe itself been identified as a non- 
ribosomally synthesised product. 

IS According to the present invention Asp-Phe 

can now be produced non-ribosomally, and novel non- 
ribosomal Asp-Phe synthetases can be used for the 
synthesis of "Asp-Phe . Hereinafter, in the part of "the - 
specification dealing with the DNA - f ragmen t s encoding 

20 the novel Asp-Phe synthetases, it will be elucidated in 
more detail how these novel Asp-Phe synthetases can be 
obtained and have been made available in the context of 
the present invention. For better understanding of the 
present invention, first, however, some general 

25 background as to non-ribosomal peptide synthesis is 
presented . 

In non-ribosomal synthesis of peptides 
known so far generally a multiple carrier thiotemplate 
mechanism is involved (T. Stein et al . , J, Biol. Chem. 
30 271, 199S, p. 15428-15435) . According to this model, 



peptide bond formation takes place on mult i -enzyme 
complexes which are named peptide synthetases and which 
comprise a sequence of amino acid recognising modules. 
On the peptide synthetases a series of enzymatic 
reactions take place which ultimately lead to the 
formation of a peptide by sequential building- in of 
amino acids, in an order predetermined by the order of 
modules recognising the cognate amino acids, into the 
peptide. This series of enzymatic reactions includes, 
schematically: 

1. recognition of the amino acid substrates; 

2. activation of said recognised amino acid 
substrates to their aminoacyl- adenylate© (that is, 
the aminoacyl adenosine -monophosphate t aa-AMP) at 
the expense of Mg 2+ -ATP (adenylation) ; 

3. binding of the aminoacyl -adenylates in the form of 
their more stable thioesters to the cysteamine 

, ^ - ; .- group. , o f . .the .enzyme -bound 4 ' - phosphopant e the iny 1 
(4'-£>P) cofacEors (thiolation) . The ATP consumed 
in the adenylation reaction is hereby released in 
the monophosphate form (AMP) ; 

4. depending of the peptide to be synthesised non- 
ribosomally, the thiol -activated substrates may be 
modified (e.g. by epimerisation or N- methyl at ion) 

5. formation of the peptide product by N to C 
stepwise integration of the thioesterif ied 
substrate amino acids (modified, as the case may 
be) into the growing peptide; 

6. releasing the peptide formed non-ribosomaliy from 
the template. 



Assuming this general scheme also to be 
correct for the novel non-ribosomal synthesis of Asp- 
Phe according to the present invention, this means that 
this synthesis involves the subsequent steps of (i) 
5 recognition of L-Asp and L-Phe, (ii) formation of an L- 
Asp- and an L - Phe -acyl adenylate, <iii) binding thereof 
to the cysteamine group of the 4'-PP cofactor in the 
respective thiolation domains, (iv) formation of the 
Asp- Phe dipeptide by transfer of the thioester- 

10 activated carboxyl group of L-Asp to the amino group of 
L-Phe, while the condensation product remains 
covalently attached to the multi-enzyme complex via the 
4'-PP cofactor in the thiolation domain of the Phe- 
recognising module , and (v) release of the Asp -Phe 

15 formed. 

According to the present invention the 
substrates L-Asp and L-Phe are contacted with a non- 
ribosomal Asp -Phe dipeptide synthetase* in thepresence 
of an effective amount of ATP. An effective amount of 

20 ATP as meant herein is an amount of ATP which ensures 
that the dipeptide formation takes place at a suitable 
rate. Usually such rate will be at least one turn-over 
per minute, i.e. a turn-over number (^c) of 1 per 
minute; preferably k CAt is ac least 10 per minute. In 

2 5 order to enable an economically attractive process the 
ATP consumed by the peptide synthesis reaction is 
preferably regenerated. 

The contacting of the substrates L-Asp and 
L-Phe wich the non-ribosomal Asp -Phe dipeptide 

30 synthetase may be done in any suitable way; for 



instance - if the Asp-Phe dipeptide synthetase is 
present in a micro-organism - L-Asp and L-Phe may be 
fed into the culture medium containing said micro- 
organism. Alternatively micro- or ganisms may be used 
which are capable of overproducing L-Asp and/or L-Fhe 
(e.g. from glucose), with separately feeding to the 
micro-organism of the amino acid (L-Asp or L-Phe) which 
is not produced by the micro-organism* All these 
methods may be called in vivo methods. ATP may be 
regenerated in vivo in the Asp-Phe producing micro- 
organism. 

The contacting of the substrates L-Asp and 
L-Phe with the non-ribosomal Asp-Phe dipeptide 
synthetase also may be done by using the synthetase in 
its isolated form, that is by an in vitro method. In 
such in vitro methods ATP -regeneration is to be taken 
care of separately. This may be done by applying an 

ATP -regeneration system, ATP- regeneration systems are 

readily available to the skilled man. 

Protein chemical studies and recent 
progress in cloning and sequencing of genes encoding 
peptide synthetases of bacterial and fungal origin have 
made it clear that the known peptide synthetases have a 
highly conserved and ordered structure composed of so- 
called modules. These modules have been defined as 
semi -autonomous units within peptide synthetases that 
carry all information needed for recognition, 
activation, and modification of one substrate. Although 
the modules in principle can act independently, it is 
generally assumed that they have to work in concert, in 
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a template -based mode of action to achieve peptide 
elongation. 

In general, the modules of peptide 
synthetases, each module being about 1000-1400 amino 
5 acids in length (i.e., the modules have molecular 

weights in the range of 120-160 kDa) , are themselves 
composed as a linear arrangement of conserved domains 
specifically representing the enzyme activities 
involved in substrate recognition, activation, (and, 

10 optionally, as the case may be, modification) and 

condensation (i.e. peptide bond formation). Two of such 
distinct domains, the adenylation and thiolation 
domains (A- domain and T-domain) , together form the 
smallest part of a module that retains all catalytic 

15 activities for specific activation and covalent binding 
of the amino acid substrate. Stachelhaus et al . have 
designated this core fragment of the modules as a 

'.'.minimal., module." (T. .Stachelhaus et al. J . .Btol . Chem. 

270, 1995, p. 6163-6169) . 

20 The term "minimal module" as used here 

therefore, according to said definition, refers to such 
combined core fragment of the modules, consisting of an 
adenylation domain and a thiolation domain. 

Some highly conserved core motifs of 

25 adenylation and thiolation domains, as known to exist 
in peptide synthetases, are listed in table 1, together 
with some highly conserved core motifs of condensation 
and thioesterase domains (which will be addressed in 
more detail in later parts of this specification) . 

3 0 The so-called "adenylation domain" (A- 



domain, about 550 amino acids) is an essential region 
of each module. The A-domain has been shown to bear the 
substrate-recognition and ATP-binding sites and is 
therefore solely responsible for activation of the 
recognised amino acid as its acyl adenylate through ATP 
hydrolysis (T. Stachelhaus et al M J, Biol. Chem. 270, 
1995, p. 6163-6169) . 



Tab 1 ft 1 ■ Highly conserved core motifs of catalytic 
domains of known peptide synthetases 



Source: M. Marahiel et al . , Chem.Rev. 
97, 1997, p. 2651-2673 



Domain 


Core (s) 
Note : Former 
nomenc 1 at ure 
is given in 
brackets 


Consensus sequence 


Adenylation 


Al 


L (TS) YxEL 




A2 {core l) 


LKAGxAYL ( VL) P (LI ) D 




A3 (core 2) 


LAYxxYTSG (ST) TGxPKG 




A4 


FDxS 




A5 


NxYGPTE 




AS (core 3) 


GELxIxGxG (VL) ARGYL 




A7 (core 4) 


Y (RK) TGDL 




A8 (core 5) 


GRxDxQVKI RGXR IELGEIE 


- • 


A9 


LpxYM(IV)P 




A10 


NGK(VL)DR 


Thiol at ion 


T (core 6) 


DxFFxxLGG (HD) S (LI ) 


Condensation 


Cl 


SxAQxR(LM) (WY)xL 




C2 


RHExLRTxF 




C3 (His) 


MHHxISDG (WV) S 




C4 


YxD (FY) AVW 




cs 


(IV)GXFVNT(QL) (CA)xR 




C6 


(HN) QD(YV) PF2 




C7 


RDxSRNPL 


Thioesfcerase 


TE 


G(HY)SxG 



5 
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Very recently the first 3D structure of an 
adenylation domain of a peptide synthetase (PheA from 
GrsA) has been reported (E . Conti et al., EMBO J- 16, 
1997/ p. 4174-4183} . This structure showe that almost 
5 all highly conserved core motifs are positioned around 
the active site where the substrates are bound. The 
main residues involved in building the substrate- 
binding pocket could also be assigned; they were found 
to be located between cere motifs A3 and A6 and were 
10 not highly conserved. 

The A-domain of a module is very important 
in determining the specificity of the module, 
"Specificity" of a module means that the module has a 
certain preference in recognising one amino acid above 
15 other amino acids or above another amino acid. Of 

course, also the concentration of each individual amino 
acid present near the module may play a role. If, for 
_ ^inst ance. , . . -t he,, concent rat ion ,of „.a .specific ami r^p ...acid . is 
much higher than that of (most of) the other amino 

2 0 acids, the requirements for specificity may be somewhat 

less strict, 

The so-called "thiol at ion domain"' (T- 
- domain-, about 10 0 amino acids; also called peptidyl 
carrier protein (PCP) > is a domain located directly 
25 downstream of the adenylation domain. It forms an 

integral part of peptide synthetases, and is the site 
of 4'-PP cof actor binding and substrate acylation. 
Within the T-domain, 4'-PP is covalently bound to the 
sidechain of an invariant serine residue located within 

3 0 the highly conserved thiolation core motif (see 



table 1), If the T-domain in a module of the peptide 
synthetase does not carry its 4'-pp cof actor, no 
covalent binding of the aminoacyl substrate can take 
place and chain elongation will be impossible. 

It has been found that in peptide 
synthetases known so far, every T-domain is converted 
from the inactive apo form to the active holo form by 
transfer of the 4'-PP moiety from Coenzyme A (CoA) to 
the sidechain of the above mentioned serine residue. 
This post-translational priming of each T-domain is 
mediated by peptide synthetase specific members of a 
recently discovered enzyme superfamily, the 4'- 
phosphopantetheinyl transferases. They utilise CoA as a 
common substrate, and appear to attain specificity 
through protein/protein interactions. 

The Asp-Phe dipeptide synthetase as used in 
the method of the present invention comprises two 

minimal, modules , respectively :one^minimal.-module..at-its 

N- terminal side recognising L-Asp and another minimal 
module at its C-terminal side recognising L-Phe. The 
term "minimal module' 1 is used in the same meaning as 
given thereto by Stachelhaus et al . , J. Biol. Chem. 
270, 1595, p. 6163-6169. 

Each of these minimal modules is composed 
of an adenylation domain (A-domain) and a thiolation 
domain (T-domain) . 

Moreover, the two minimal modules of the 
Asp-Phe dipeptide synthetases according to the 
invention are connected by a so-called condensation 
domain, which n eds to be covalent ly bound to the 
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polypeptide chain of the Phe-module, namely to the N- 
terminal part thereof. The condensation domain, 
however, does not need to be bound covalently to both 
minimal modules (i.e. to the modules recognising 
respectively L-Asp and L-Phe) because there is no 
requirement that these two minimal modules are located 
on a single polypeptide chain. The term "connected" 
therefore means that the condensation domain ensures 
that both minimal modules can operate concertedly- 

In known peptide synthetases the 
condensation domains occur as a moderately conserved 
region. The condensation domain (conserved region) 
usually consists of about 400 amino acid residues and 
is known to be involved in the catalysis of non- 
ribosomal peptide formation. One of the conserved core 
motifs contains the catalytically active hietidine 
residue. See T. Stachelhaus et al., J- Biol. Chem. 373 , 
1998 , p. 22 773 -22781, , _ . . ... . . . 

The Asp-Phe formed can be recovered from 
the reaction medium by any method available to the 
skilled man. 

It is preferred, that the condensation 
domain in the dipeptide synthetase is connected to both 
minimal modules in such way that it is also covalently 
bound to the module recognising L-Asp. In such case the 
condensation domain is not only bound covalently to the 
M- terminal end of the L-Phe recognising module, but 
also to the C-terminal end of the L-Asp recognising 
module, and forms part of a single polypeptide chain 
comprising the L-Asp and L-Phe recognising modules. 
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A distinguishing feature of non-ribosomal 
peptide synthesis is the fact that the peptide formed 
on the template is covalently bound to the T-domain of 
the C-terminal module as a peptidyl- (4 1 -PP) -T-domain 
5 intermediate. Release of the peptide from this 
intermediate is assumed to take place either by 
intermolecular attack by water, resulting in net 
hydrolysis, or by intramolecular capture by a hydroxyl 
or amino group of the peptide chain itself, giving rise 

10 to a cyclic peptide product and of the peptide 

synthetase in the holo form. The first termination 
route yields a linear peptide with a free C-terminal 
carboxylic group (as should be the case for the Asp-Phe 
synthesis according to the present invention) . 

15 Because the Asp-Phe is present in an 

intermediate form bound to the template of the Asp-Phe 
dipeptide synthetase, it is advantageous to take 

additional measures .for enhancing the release of the 

Asp-Phe from said template. 

20 It is therefore particularly preferred that 

also a releasing factor is present for the Asp-Phe 
formed on the dipeptide synthetase. The term "releasing 
factor" as used here is intended to comprise any means, 
whether part of the synthetase or present in combination 

2 5 therewith, which enhance the releasing from the 

synthetase of the Asp-Phe formed on the synthetase. 

All known bacterial and some fungal peptide 
synthetase modules that incorporate the last amino acid 
into the growing peptide chain show a region with a 

30 thioesterase-like function* These regions of 
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approximately 250 amino acids are located ac the C- 
terminal end of the amino acid recognising modules . 
These thioesterase-like regions are integrated regions 
which exhibit homology to thioesterase-like proteins, 
5 and therefore also are referred to as the thioesterase 
domain ((integrated) TE- domain) . All these integrated 
TE-domains contain an active site serine residue/ which 
is part of the core motif GxSxG (see table 1> . 

Recent work has given support for the 

10 theory that these integrated TE -domains are involved in 
the termination of the chain elongation and the product 
release. For instance, deletion of the complete TE- 
domain from the surfactin synthase led to a 97% 
reduction of the in vivo surfactin production 

15 (Schneider A. , et al . , Arch- Microbiol ., 169, 1998, 
p. 404-410) . Furthermore, it has been shown that 
replacing the integrated TE-domain from the Oterminus 

. of : module , 1 of the surfactin synthase ,to. the C- terminal 

ends of modules 4 and 5, resulted in the formation of 

2 0 the corresponding lipotetra- and pentapeptide. Also in 

this study the removal of the integrated TE-domain led 
to an almost complete redaction of peptide synthesis 

. . (Ferra .F de, et al , ,. J... Biol ► Chem. , 272 , 1997 , 

25304-25309) . 

25 In a particularly preferred embodiment of 

the invention the releasing factor therefore is a 
protein which shows thioesterase-like functions and 
forms an integrated domain of the dipeptide synthetase 
at the C-terminus thereof. 

3 0 in addition it is preferred that the Asp- 
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Phe dipeptide synthetase, prior to the production of 
Asp-Phe, has undergone optimisation for its function by 
using one or more post-transiational modifying 
activities. This is useful for achieving the most 
5 efficient non-riboaomal synthesis of Asp- Phe. 

The term "post-translational modifying 
activities fl for efficient non-ribosomal synthesis of 
Asp -Phe as used here is intended to comprise any 
activities which modify the dipeptide synthetase after 
10 its formation thereby positively affecting its Asp- Phe 
eynthesising function. 

In particular, in the production of Asp-Phe 
according to the present invention the post- 
translational modifying activity used is a 4 1 - 
15 phosphopantetheinyl {4'-??) transferase- The 4'-PP 

transferase provides for effective conversion of the j 
apo- to holo -enzyme of the peptide synthetase and by 

loading the 4 ' -PP cof actor to the -serine side-chains in 

the core motif of the T-domains, and thereby increase* 
20 the yield of Asp-Phe in the production thereof.' 

Effective conversion of apo- to holo-enzyme is provided 
if in each of both T-domains of the Asp-Phe dipeptide 
synthetase at least 10% of the apo-enzyme is converted 
to the holo-form, 
25 it is particularly preferred that in the 

production of Asp-Phe according to the invention also a 
non- integrated protein with thioesterase Type-II-like 
activity is present together with the dipeptide 
synthetase. As meant herein proteins having 
30 thioesterase Type-II-like activity are proteins with 
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strong sequence similarities to type-II fatty acid 
thioesterases of vertebrate origin. Such non- integrated 
protein with thioesterase Type-II-like activity is 
different from the integrated thioesterase (TE-domain) . 
Recent work (Schneider et al - , Arch. Microbiol. (1998), 
169 , 404-4103 has shown that deletion of a gene 
encoding such non- integrated protein with thioesterase 
Type-II -like activity from the surfactin synthase 
operon leads to an 84% reduction of peptide production. 
It is suggested that the non- integrated protein with 
thioesterase Type-II-like activity enhances production 
of non-ribosonial peptides, possibly by reactivation 
through liberation of mischarged modules that are 
blocked with an incorrect aminoacyl group or an 
undesired acyl group at the 4'~PP cof actor. 

The genes coding for the non- integrated 
proteins with thioesterase Type-II-like activity can be 
poe i t i one d a t t he S ' _ - . or . . 3 - e nd of the peptide 
synthetase encoding operon. These proteins have 
molecular masses of 25-29 kDa, are about 220-340 amino 
acid residues in length, and carry the sequence GxSxG 
which is presumed to form the active site. It is 
noticed that in almost all of the prokaryotic peptide 
synthetase coding operons known so far, such discinct 
genes have baen detected. 

In the production of Asp-Phe according to 
the present invention the Asp-Phe dipeptide synthetase 
is preferably present in living cell-material of a 
micro-organism, and glucose, L-Asp and/or L-Phe are 
being fed to said fermentor, and the Asp-Phe formed is 
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then recovered. As used in this context the term 
"glucose" is intended to cover glucose and any other 
energy source necessary for the regeneration of ATP in 
the living cell material and for the maintainance 
5 energy required for said living cell material. The 

glucose (or other energy source) , moreover, ie used as 
starting material for the production of any li-Asp 
and/or L-Phe to be produced in the living cell in the 
course of the process of the invention. 
10 The skilled man, of course, will be aware 

that the feeding of glucose, L-Asp and/or L-Phe ie to 
be done under appropriate conditions of temperature and 
pH, including as required the presence of an 
appropriate nitrogen source, salts, trace elements, and 
15 other organic growth factors as vitamins and amino 

acids, etc. to the fermentor or other type of (enzyme) 
reactor which is used for the production of Asp-Phe. 

T he Asp-Phe formed is recovered. -Such recovery -may take 

place during the process or at the end thereof. 
20 The living cell-material may be present in 

any appropriate form as available to the skilled man. 
For instance, whole cells may be used as such or in 
immobilised form. The micro-organism may be any kind of 
micro-organism wherein the Asp-Phe dipeptide 
25 synthetases according to the invention can stably be 
expressed. Suitable micro-organisms are, for instance/ 
micro-organisms which 

(a) are producing peptid s via non-ribosomal synthesis, 
for instance, bacteria as Streptomyces species, 
30 Bacillus species, Actinomyces species, Micrococcus 
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species, Nocardia species, or fungal species as 
Tolypocladium species, Fusariuxn species, Penicillium 
species, Aspergillus species, and Cochliobolus species; 



or 



5 (b) axe capable of producing amino acids, in particular 
L-Asp and/or L-Phe, preferably on industrial scale, for 
instance, Escherichia species, e.g. E.coli, and 
Corynebacteri um species, e.g. C.glutamlcum, 

The micro-organisms may be grown , under 
10 conditions which can easily be found by the skilled 

man, in a fermentor, and production of the Asp-Phe then 
can be carried out in the same or in another fermentor. 
As meant herein the fermentor may be any type of 
fermentor or other types of (enzyme) reactor known to 
15 the skilled man. 

In the method for the production of Asp-Phe 
according to the present invention it is preferred that 

the micro-organism- is first grown in a fermentor to - 

reach a predetermined cell densicy before rhe 
20 expression of the Asp-Phe dipeptide synthetase is 

switched on and feeding of the glucose, L-Asp and/or L- 
Phe for the synthesis of the Asp-Phe dipeptide is 
started. 

The skilled man can easily determine the 
25 growth of the micro-organism, e.g. by measuring its 
optical density (O.D.), and find the most appropriate 
level of cell density. To prevent any negative effect 
on the growth of the micro-organism, growth phase and 
Asp-Phe synthetase productio phase are preferably 
30 uncoupled. Such uncoupling can be achieved by 
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expressing the gene for the Asp-Phe synthetase from an 
inducible, tightly regulable, promoter. The expression 
of the Asp-Phe dipeptide synthetase is preferably 
switched on by addition o£ a specific chemical 
5 component (inducer) or by changing the physical 
conditions, e.g. the temperature, pH or dissolved 
oxygen pressure, after a predetermined level of cell 
density has been reached. The expression is assumed to 
be switched- on as compared to the non- induced state, if 
10 the expression level of the Asp-Phe dipeptide 

synthetase is raised at least by a factor of 10, 

Then also the feeding of substrates, etc. 
in amounts as required, is started, and production of 
Asp-Phe starts. 

15 Most preferably the micro-organism is an L- 
ohenylalanine producing micro-organism and, apart from 
required amounts of salts and trace elements etc,, only 
_ glucose, and L-Asp are being fed- L-Fhe producing micro- 
organisms are well-known- For instance, E. call and 
2 0 Corynebacteriu/n species are being used for L-Phe 
production. By expressing an Asp-Phe dipeptide 
synthetase in such micro-organisms availability of L- 
Phe in the micro-organism is provided for, and only 
glucose, an appropriate nitrogen source, organic growth 
25 factors, salts and trace elements, etc, as well as L- 
Asp, should be supplied as required. 

In particular the micro-organism used is an 
Escherichia or a Bacillus species. 

Best results are obtained if the micro- 
organism used is a strain with reduced protease 
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activity for Asp-Phe or lacking such activity towards 
Asp-Phe. By using such strains degradation of Asp-Phe 
formed is prevented, Any suitable strain which lacks 
protease activity (either naturally or because the 
activity of the protease has been lowered substantially 
or has been removed completely) may be used. 



occurs in a micro-organism, also additional measures 
can be taken for improving the permeation of the Asp- 
Phe formed into the reaction medium outside the micro- 
organism and recovering the Asp-Phe therefrom after 
separation of the micro-organism from the reaction 
medium. Similarly, also additional measures may be 
taken for improving the uptake of glucose and/or L-Asp 
and/or L-Phe. 

In an even more preferred embodiment of the 
invention the micro-organism used also contains a 
suitable export system for Asp-Phe formed and/ or one or 
more suitable uptake system (s) for glucose and/or L-Asp 
and/or L-Phe. Using a suitable export system will 
ensure achieving more efficient secretion of the Asp- 
Phe formed- The secretion meant here is the secretion 
of Asp-Phe formed in the micro-organism into the 
extracellular environment. Efficient secretion of Asp- 
Phe is important for improving the recovery yield of 
Asp-Phe and for maintaining the activity of the Asp-Phe 
dipeptide synthetase at a suitable level ae well as for 
preventing intracellular degradation of Asp-Phe. 
Moreover, the down-stream processing for Asp-Phe 
secreted is more easy. 



Moreover, if the synthesis of Asp-Phe 
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Similarly, the presence of suitable uptake 
system (s ) will improve the coupling efficiency to Asp- 
Phe. 

in the foregoing paragraphs in vivo non- 
5 ribosomal synthesis methods for Asp-Phe have been 

described. They all are characterised in that living 
ceil material is used and ATP regeneration takes place 
in this cell material. The present invention can also 
be carried out in vitro. As used herein, in vitro 

10 systems are characterised in that the Asp-Phe dipeptide 
synthetase is not present in living cell material; it 
may, however, be present in any other environment, for 
instance in permeabilised cells, cell-free extract, or 
as an isolated dipeptide synthetase. In such case 

15 regeneration of ATP does not take place in living cell 
material used for the synthesis of Asp-Phe, and special 
measures for supply of ATP in an effective amount have 

to -be taken. — - ~ . - ; _._ 

In a preferred embodiment of the invention, 

20 the production of Asp-Phe is carried out in vitro in an 
enzyme reactor, while ATP is supplied, and L-Asp and/or 
L-Phe are being fed, and the Asp-Phe formed is 
recovered. 

In order to improve the economic 
25 feasibility of the process it is particularly preferred 
to increase the yield of Asp-Phe per mole of ATP 
supplied for the synthesis of Asp-Phe. This can be 
achieved by in situ ATP regeneration from the AM? 
formed out of the ATP in the consecutive adenylation 
3 0 and nhiolation reactions. 
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Therefore, in a preferred mode of the 
invention, the supply of ATP is provided in part by an 
in situ ATP -regenerating system. 

Various ATP regenerating systems (which in 
5 the literature are also being referred to as ATP 

generating systems) are known co the skilled man. As 
ATP regenerating systems both whole cell systems (e.g. 
yeast glycolysis systems) or isolated ATP regenerating 
enzymes, for instance adenylate kinase combined with 

10 acetate kinase, may be used. A very elegant ATP 

regeneration system has been described by T. Fujio et 
al. (Biosei., Biotechnol., Biochem. 61, 1997, p. 840- 
845) . They have shown the use of permeabilised 
Corynebacterium ammoniagenes cells for regeneration of 

15 ATP from the corresponding monophosphate (AMP) coupled 
to an ATP-requiring reaction in permeabilised E. coli 
cells. In this elegant way (cheap) glucose can be 

supplied., as ^an^. energy .source instead of most of . the 

ATP . 

20 Therefore, the ATP -regenerating system is 

preferably present in a permeabilised micro-organism. 
This permeabilised micro-organism present in the 
(enzyme) reactor used ensures that an effective amount 
of adenosine- triphosphate (ATP) will always be present 

25 and available during the Asp-Phe production according 
to the present invention. 
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DMA FRAQMgNTB ENCODING AN Afltt-Fhe DIPgPTIDB 3YNTHfcTAgfi; , 

The present invention also relates to novel 
DNA fragments encoding an Asp-Phe dipeptide synthetase. 
5 These novel DNA fragments or combination of 

DNA fragments code for a ncn- ribosomal Asp-Phe 
dipeptide synthetase, which synthetase comprises two 
minimal modules connected by one condensation domain 
wherein the N-terminal module of these modules is 

10 recognising L-aspartic acid and the C-terminal module 
of these modules is recognising L-phenylalanine and is 
eovalently bound at its N-terminal end to the 
condensation domain, and wherein each of these minimal 
modules is composed of an adenylation domain and a 4 1 - 

15 phosphopantetheinyl cofactor containing thiolation 
domain . 

The term "DNA- fragment or combination of DNA 
fragments" as used herein. is- understood to -have its 
broadest possible meaning. The term first of all relates 

20 to the composite biological material (on one or more 

DNA -fragments) as mentioned herein-above and coding for 
the minimal modules for Asp and Phe in the correct order 
and for the condensation domain , each coding sequence 
being surrounded by any transcription and translation 

25 control sequences (e.g* promoters, transcription 

terminators) and the like which may be suitable for the 
expression of the Aep-Phe dipeptide synthesising 
activity. The control sequences may be homologous or 
heterologous, and the promoter (s) present in the DNA may 

30 be constitutive or inducible. 



The term "DNA- fragment " as used herein is 
further understood to code, in addition to coding for 
the Asp and Phe minimal modules and the condensation 
domain, for the activities of the other domains, e.g. 
tb -domains . Furthermore, these fragments may code for 
activities which are not located on the Asp-Phe 
dipeptide synthetase polypeptide itself, such as non- 
integrated thioesterase Type- II -like proteins, and other 
activities co-operating concertedly with the Asp and Phe 
minimal modules. 

The term M DNA- fragment 11 as used herein is 
also understood to comprise gene structures comprising 
DNA fragments as described herein-above. More 
precisely, a gene structure is to be understood as 
being a gene and any other nucleotide sequence which 
carries the DNA- fragments according to the invention. 
Appropriate nucleotide sequences can, for example, be 

-plasmids, vectors, chromosomes, or phages* The gene 

structures may exist either as {part of) an 
autonomously replicating vector in single or multicopy 
situation, or integrated into the chromosome in single 
or multicopy situation, 

The- gene structure is also to be understood- 

as being a combination of the above-mentioned gene 
carriers, such as vector© , chromosomes and phages, on 
which the DNA- fragments according to the invention are 
distributed. For example, the Asp-Phe dipeptide 
synthetase encoding DNA- fragment can be introduced into 
the cell on a vector and the ncn- integrated thioesterase 
Type- I I -like protein encoding DNA- fragment can be 
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inserted into the chromosome. In addition, a further 
DNA- fragment can, for example, be introduced into the 
cell using a phage. These examples are not intended to 
exclude other combinations of DNA- fragment 
5 distributions from the invention. The DNA -fragments 
according to the invention may be introduced into the 
micro -organism at a sufficiently high copy number/ for 
instance of up to 50 copies. 

A detailed discussion of the Asp-Phe 
10 dipeptide synthetase and the two minimal modules 
comprised therein already has been given in the 
preceding parts of this patent application. 

The construction of the DNA fragments 
according to the present invention ie not self-evident, 
15 although terms like "module", "domains", etc, 

misleadingly might suggest that much is known about the 
functional boundaries thereof. Such detailed information 

which would ^ enable rational design of (mutant -non- 

natural) peptide synthetases, however, ie not yet 
2 0 available. Nevertheless, recently various techniques for 
construction of mutant peptide synthetases have been 
described in literature. Methods for construction of 
mutant peptide synthetases described in literature are; 

De Perra ec al . (J, Biol. Chem., 272, 1997, 
25 p. 25304-25309) have described a method for producing 
truncated peptides of a predicted sequence by 
replacement of the integral TE-domain of the surfactin 
synthecase from the C-terminal module to the C-terminal 
end of different internal modules. This technique alone, 
however, is not suitable for the construction of an Asp- 
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Phe dipeptide synthetase module because the natural 
order of two consecutive Asp and Phe modules is not 
known to exist in nature (neither at the N- or C- 
terminal end of any natural synthetase, nor as an 
5 internal sequence of any naturally occurring 
synthetase) . 

Another in vivo technique described for 
engineering peptide synthetases is the so-called 
programmed alteration within the primary structure of a 

10 peptide product. Basis for this method is the 

replacement of one module by another on the genetic 
level. This technique has been described in general by 
A. Schneider et al., Mol. Gen. Genet., 257, 1998, 
p. 308-316). In this way amino acid-activating minimal 

15 modules could be exchanged successfully in vivo between 
multi-modular peptide synthetases of heterologous 
origin, and micro-organisms could be obtained which 
indeed produce non-ribosomal peptides with a different 
primary structure from the peptides produced without 

20 such alteration. 

If this technique would be applied for the 
construction of an Asp-Phe dipeptide synthetase, in 
principle two options would be available, each starting 
from a dipeptide synthetase, namely having a sequence 

25 of two modules comprising either (i) Asp and XXX, the 
latter representing any other amino acid than Phe, or 
(ii) YYY and Phe, the former representing any other 
amino acid than Asp, In those dipeptide synthetases the 
DNA coding for the XXX module should be replaced by DNA 

30 coding for the Phe module , or the DNA coding for the 





YYY module by DNA coding for the Asp module. 

Other methods for construction of peptide 
synthetases have been described in EP-A-0637630 , In 
said patent application a method is suggested whereby, 
5 next to alteration of the substrate specificity by 

substitution of (part of) modules, also modules can be 
deleted or inserted into the synthetase chain. 
"Specificity" of a module means that the module has a 
certain preference in recognising one amino acid above 
10 other amino acids or above another amino acid, 

It is a distinctive feature of the above 
peptidase engineering methods that homologous 
recombination events are used to bring about the 
desired changes in the genomic DNA in the native 
15 peptide producing micro-organism. Because the 

homologous recombination events take place in the 
native peptide producing micro-organism, these methods 
would, have the advantage than all the native : host ... 
enzymes and relevant regulatory elements are present in 
20 principle. 

However, homologous recombination through 
use of the native non-ribosomal peptide producer 
suffers from several severe drawbacks. The most serious 
of these drawbacks is that it is often tedious and 

25 technically difficult, especially when applied to slow- 
growing micro-organisms with poorly developed 
transformation systems or which are lacking in other 
genetic tools. Other drawbacks are that the native non- 
ribosomal peptide producing micro-organisms often do not 

3 0 have a history of safe use on industrial scale, are no 



production organisms for L-Phe and/or L-Asp, and have 
unknown fermentation characteristics. Moreover, all 
these methods result in cells having only a single copy 
of the DNA- fragment coding for the desired peptide 
synthetase. Therefore, none of these in vivo engineering 
methods are suitable for the preparation of the novel 
Asp-Phe dipeptide synthetase according to the invention 
and use thereof for the industrial production of Asp- 
Phe. 

The present inventors now have found that 
the Asp-Phe dipeptide synthetase can be readily obtained 
by use of in vitro engineering techniques. So far no in 
vitro engineering techniques for the construction of 
peptide synthetases have been described. Detailed 
protocols for the construction of Asp-Phe dipeptide 
synthetases according to the invention can be found in 
the experimental part of this application. 

.,_„. The Asp-Phe dipeptide synthetase encoding 
DNA- fragment can be constructed in vitro from an Asp -XXX 
or YYY-Phe (with XXX and YYY having the meaning as 
described above) dipeptide synthetase encoding DNA- 
fragment (or partial sequences for such synthetase 
encoding fragments as occurring in a naturally existing 
peptide synthetase) - This was accomplished starting from 
an Asp-Leu (Leu * leucine) dimodular peptide synthetase 
encoding DNA- fragment which was obtained from the 
.Bacillus subtilis ATCC 21332 surfactin synthetase A gene 
(sr-fA-B) by ?CR method. The Leu minimal module encoding 
DNA- fragment thereof then was replaced by a DNA- fragment 
(obtained by PCR method) from the Bacillus brevis ATCC 
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S18S tyrocidine A synthetase gene coding for a Phe 
minimal module (tycA) . Then an integral TE-domain was 
added to the C~ terminal end of the Asp-Phe encoding DMA- 
fragment by replacement of the thiolation domain of the 
5 Phe module by a PCR-fragment coding for the srfA-C 

thiolation and TE domain. This construction was done in 
such a way that the DNA encoding the additional TE 
domain was fused in- frame with the gene encoding the 
Asp-Phe synthetase* As a result the TE-domain forme an 
10 integrated part of the Asp-Phe synthetase produced. In 

the experimental part of this application this TE-domain 
containing Asp-Phe synthetase will be referred to as 
Asp-Phe-TE. 

After the construction! the encoding DNA- 
15 fragments were introduced into a suitable host micro- 
organism. Suitable host micro-organisms are, for 
instance, E. coli and Bacillus species. After 

cultivation of these micro -organisms under inducing 

conditions, cells were lysed and the synthetases 
20 produced were purified by IMAC (Immobilised Metal 
Affinity Chromatography) . The purified enzyme 
preparations were used for different experiments to 
prove the formation of Asp-Phe. 

25 

Preferred DNA fragments 

The following preferred aspects of the DNA 
fragments encoding the Asp-Phe dipeptide synthetase 
according to the invention closely correspond to the 
30 aspects discussed in the previous parts of this 
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ep.cif ieation regarding the preferred methods for the 
production of Asp-Phe, 

For the DMA fragments encoding the Asp-Phe 
dipeptide synthetase according to the invention it is 
5 especially preferred that the condensation domain in 
the encoded dipeptide synthetase is connected to both 
minimal modules in such way that it is also covalently 
bound to the module recognising L-aspartic acid. 

In particular it is preferred that the DNA 

10 fragment or combination of DNA fragments encoding the 
dipeptide synthetase also code for a releasing factor 
for the Asp-Phe formed on that dipeptide synthetase - 
The term "releasing factor" is used in the same meaning 
as it has been used in the previous part of the 

15 specification. 

In a more particularly preferred embodiment 
of the present invention/ the DNA fragment or 

- combination of DNA fragments encoding the Asp-Phe 

dipeptide synthetase is/are also coding for a protein 

20 which shows thioesterase-like functions and forms an 

integrated domain of the dipeptide synthetase at the C- 
terminus thereof. For an explanation of the terms 
.. . . _ "integrated domain" etc . , reference is made. to. earlier 
parts of the present application. 

25 In addition/ the synthetase encoding DNA 

fragment or combination of DNA fragments preferably 
also express (es) one or more post-translat ional 
modifying activities for efficient non-ribosomal 
synthesis of Asp-Phe on the synthetase. The terms 

3 0 "post-translaticnal modifying activities", etc. are 
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used in the same meaning as they have been used in the 
previous part of the specification. 

In particular, the post-translational 
modifying activity expressed by the DNA fragment or 
5 combination of DNA fragments is a 4'- 

phosphopantetheinyl (4'-PP) transferase activity. The 
formation of this activity provides for effective 
conversion of apo- to holo-enzyme - Effective conversion 
of apo- to holo-enzyme, etc. already has been explained 

10 in the previous part of the specification. 

It is particularly preferred that the DNA 
fragment or combination of DNA fragments also code(s) 
for a non- integrated protein with thioesterase Type-II- 
like activity. The term "non -integrated protein with 

IS thioesterase Type-ll-like activity" is used in the same 
meaning as it has been used in the previous part of the 

i 

specification. 



2 0 Micro-organisms 

The invention further relates to micro- 
organisms containing a DNA fragment or combination of 
DNA fragments according to the invention, and in 
particular to such micro-organisms which are capable of 
25 producing L-Asp and/or L-Phe. In particular, the micro- 
organism is an Escherichia coli or Bacillus species. 
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Asp-Phe dipeptid sywthetaa s 

The present invention finally also relates 
to novel Asp-Phe dipeptide synthetases. The terms and 
expressions used hereinafter with respect to the Asp-Phe 
5 dipeptide synthetase all have the same meaning as 
explained herein-above. 

The non-ribosomal Asp-Phe dipeptide 
synthetases according to the present invention are 
characterised in that they comprise two minimal modules 
10 connected by one condensation domain wherein the N- 
terminal module of these modules is recognising L- 
aspartic acid and the C-terminal module of these 
modules is recognising L-phenylalanine and is 
covalently bound at its N- terminal end to the 
15 condensation domain, and wherein each of these minimal 
modules is composed of an adenylation domain and a 4'- 
phosphopantetheinyl cofactor containing thiolation 

-domain-, - - • * • - — - — - 

In particular, the condensation domain in 
2 0 the dipeptide synthetases is connected to both minimal 
modules in such way chat it is also covalently bound to 
the module recognising L-aspartic acid. 

. Preferably, the Asp-Phe dipeptide- - 
synthetase also comprises a releasing factor for the 
25 Asp-Phe formed on that dipeptide synthetase. 

Most preferably, the releasing factor is a 
protein which shows thioesterase-like functions and 
forms an integrated domain of the dipeptide synthetase 
at its C- terminus. 
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The invention hereinafter now will be 
clarified further in the experimental part, but will in 
no way be restricted to the experiments shown. 

EXPERIMENTAL PART 

General procedures 

Standard molecular cloning techniques such 
as DNA isolation, gel electrophoresis, enzymatic 
restriction modifications of nucleic acids r E.coli 
transformation etc., were performed as described by 
SambrooJc et al., 1989, ''Molecular Cloning: a laboratory 
manual", Cold Spring Harbor Laboratories, Cold Spring 
Harbor, New York and Innis et al., 1990, "PCR 
protocols, a guide to methods and applications" 
Academic Press, San Diego. Synthetic oligo 
deoxynucleotides were obtained from MWG-Biotech. AG, 
Ebersberg. DNA sequence analyses were performed on an 
Applied Biosystems ABI 310 genetic analyzer, according 
to supplier's instructions • Sequencing reactions were 
carried out by the chain termination method with dye- 
labelled dideoxy terminators from the PRISM ready 
Reaction DyeDeoxy Terminator cycle sequencing kit with 
AmpliTaq PS polymerase I Applied Biosystems) . 

Construction of plasmid pa$p-leu-His 6 

A 4 934 bp fragment comprising regions from 
the srfB locus from chromosomal Bacillus suJbtilis ATCC 
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21332 DNA was amplified (PCR) using th following 
primers : 

5' TAA GCA TGC TGC TTT CAT CTG CAG AAA C (5' asp-leu- 
Sphl - sr/B2 ) , and 
5 3' AAT GGA TCC TTC GGC ACG CTC TAG (3' aep-Ieu-BamHI - 
srfB3) . 

Correct size of the amplified fragment wae 
confirmed by agarose gel electrophoresis. 

The fragment (20 j*g) was digested with 1 
10 unit of the enzymes BaurHl/Sphl (37°C, 15 h) to generate 
terminal restriction sites. 

Plasmid pQB70 {provided by Qiagen, D-Hilden> 
(10 jig) was digested with the same enzymes and 
subsequently incubated for 1 hour with 1 unit Alkaline 
15 Phosphatase (37 °C) . Complete digestion was confirmed by 
transforming 1 |iL of the linearised plasmid DNA into 
competent cells of -S,co2i XL1 blue. The two fragments 
were subsequently Tigated in a ligation reaction (10 "fiL> 
in a vector/insert ratio of 1:3 with 1 unit of T4-DNA- 
20 ligase enzyme (16°C, 16 h) . 

1 jxL of the ligation mixture was used to transform 40 |4L 
competent cells of E.coli XL1 blue (Stratagene, D- 
Heidelberg) by electroporation. The transf ormants were 
selected on 2x YT agar plates containing Ampicillin (100 
2 5 jug/mL) , Analysis of 48 transf ormants resistant to 

ampicillin revealed that 4 of them had inserted a ca. 
5000 bp fragment. Correct insertion was confirmed using 
restriction enzyme digestion analysis and terminal 
sequencing of the insert . A correct clone designated 
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pasp-2eu-Jfis 6 was used for further investigations. 



Construction of plasmid paBp-phe-Hla $ 

5 Plasmid paap-phe-Hise was constructed from 

plasmid pasp~leu~His € as follows, 
;i A 1894 bp chromosomal DNA- fragment from 

Bacillus brevia ATCC 8185 DNA was amplified <PCR) using 
the following primers: 
10 5 P ATT TGG TCA CCA ATC TCA TCG ACA A (5' BstEII-TycA- 
MLID) , and 

5' ATA GGA TCC TGT ATT CGT AAA GTT TTT C (3'-PheAT- 
BairHI) . 

Correct size of the fragment was confirmed 
15 using agarose gel electrophoresis, 
■j The fragment was digested with 1 unit of 

I enzyme BarriHI and incubated at 30°C for 4 hours . 

^Subsequently 1 unit of enzyme BstEIl was added and 

incubated for another 4 hours at 60 D C< 
2 0 Plasmid pasp-leu-Hi& 6 was digested in the 

same way and eubsequently incubated for 1 hour with 1 
unit of Alkaline phosphatase. The vector portion (ca. 
6,5 kb) was separated from other DNA fragments by 
agarose gel electrophoresis and repurified. Complete 
25 digestion was confirmed as before with linearised pasp- 
Ieu-tfis 6 . The two fragments were ligated in a equimolar 
ratio for S hours at 16°C using 1 unit of T4~ligase 
enzyme, 1 jiL of the ligation mixture was used for 
electroporation of B.coli XL1 blue competent cells. 
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Transf ormants were selected on 2x YT agar containing 
Ampicillin (100 ^g/mL) , Analysis of transf ormants 
revealed that l out of 90 clones had inserted a fragment 
of ca. 2000 bp. Correct insertion was confirmed using 
restriction enzyme digestion analysis and terminal 
sequencing of the insert. 

The correct clone was designated pasp-phe- 
In contrast to the peptide synthetase encoding 
gene on plasmid pasp-leu-tfie^ the peptide synthetase 
encoding gene on plasmid pasp-phe-Hi8 6 is a hybrid gene 
obtained by exchanging the DNA-fragment coding for the 
second (Leu) minimal module (A- and T-domain) , for a 
DNA-fragment coding for a Phe minimal module. 
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Const ruction of plasmid pasp-.p*ae-TE-His 6 

Plasmid pasp-phe-TE-His^ was constructed 
from plasmid p^ep-phe-Hia^ 

A 910 bp chromosomal DNA-fragment from 
Bacillus eubtilxa ATCC 21332 DNA was amplified (PCR) 
using the following primers; 

5' ATA ATC GAT AAT CGC ACA AAT ATG GTC <5' TB-SrfCl- 
Clal) and 

3' ATA AGA TCT AAC AAC CGT TAC GGT TTG TGT <3 f int TE- 
srfCl-Sglll) . 

Correct size of the fragment was confirmed 
using agarose gel electrophoresis. 

The fragment was digested with 1 unit of 
enzyme C2al for 4 hours at 3 7°C, before adjusting buffer 
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conditions and digesting with l unit of enzyme Bglll (4 
hours, 37 C C) . 

Plasmid pasp-phe-Hist was digested with enzyme Clal (4 
h, 3 7°C) and subsequently with SawHI (4 h, 37°C) before 
5 the linearised plasmid was incubated for one hour with 1 
unit of Alkaline phosphatase. The vector portion (ca. 8 
kb) was separated from other DNA- fragments by agarose 
gel electrophoresis and repurified. 

Control of complete digestion, ligation, 
10 electroporation and selection of transf ormants was 
established as described before. 

Two of the analysed transformants were shown 
to contain the desired DMA-fragment, Correct insertion 
of the 90 D bp fragment was confirmed by restriction 
15 enzyme analysis and terminal sequencing of the insert. 

A correct clone was designated paap-phe-TE- 
Hist* in contrast to the peptide synthetase encoding 
gene. on plasmid pasp-phe-Hia^ the peptide .synthetase 
encoding gene on plasmid pasp-phe-TE-HiBe contains a 
20 second fusion site located between the DNA coding for 
the adenylation domain and thiolation domain of the 
second (Phe) minimal module. The C- terminal T-T2 domains 
resemble che native C-terminus of the Surfactin 
synthetase srfC. 

25 



Expression of the peptide synth tasee asp-2eu-Jfitf 6 , asp- 
phe-H±a € and asp»pJ3e-TS-j?is s 

1 nL of each constructed plasmid were 
transformed in E.aoli BL2l/pgsp competent cells. Strain 
5 BL2L 1DS3 was obtained from Stratagene, D-Heidelberg. 
Plasmid pgsp, which is based on plasmid pREP4 (obtained 
from Qiagen, D-Hilden) , contains the gsp gene (the 4 f -PP 
transferase gene from the Gramicidin $ -biosynthesis 
operon from Bacillus brevis ATCC 9999) under control of 

10 the T7 promoter. 

Transformants were selected on 2x YT agar 
plates containing Ampicillin (100 jag/mL > and Kanamycin 
(25 pig/mL) . Several colonies were used to inoculate 4 mL 
of 2x YT liquid medium (containing in addition 10 mM 

15 MgCl a ) and incubated at 37°c for 16 hours. These 4 mL 
culture© were subsecjuentiy used to inoculate 400 mL of 
the same medium. Cells were grown at 3 0 °C in a waterbath 

_ . r . shaker . (2 50..rpm). . Af ter 3-4 hours, the. cells reached an 

optical density of 0,7 (OD 600im ) and were induced by the 

2 0 addition of 2 00 IPTG. Cells were incubated for an 
additional 1,5 hours before being harvested. 

Expression of recombinant proteins was 
confirmed by SDS-PAGE comparing protein samples taken at 
the time of induction and 1,5 hours later* 

25 In crude cell extracts from BL2l/pgsp/pasp- 

leu-HiSt BL21/pgsp/pasp^piae-tfis 6 expression of an 

inducible protein of ca. 180 kDa was confirmed. From 
crude cell extracts of SL2 l/pgsp/pasp-p/ae-TS-His 6 
expression of an inducible protein of ca. 200 kDa could 



be shown. 

From cultures expressing the correct 
recombinant proteins glycerol stocks were prepared and 
stored at -80 G C. 

Purification of the recombinant: proteins Aep-Leu-JMfl$ # 
Asp-Fhe-fii^g and Asp-Phe-TE-2Iis 6 

800 mlj cultures of BL21/pg8p/pasp-leu-Hi3 6 , 
BL21/ pg8p/paBp-phe-His € and BL21/pysp/pasp-phe-TE-ffis s 
treated as described in "Expression of the peptide 
synthetase© . were centrifuged at 5000 rpm for 5 
minutes and resuspended in 30 mL/L culture of buffer A 
(50 mM HE PES, 300 mM NaCl, pH 8,0). Cell suspensions 
were used directly or were stored at -20 °C till usage. 
Cell lye is was established using two French press 
passages at a working pressure of 12000 psi. 

- Directly- after cell lysis- PMSF was added to 

a final concentration of 1 mM. After centrif ugation of 
the cell lysates at 10000 rpm for 30 minutes, the 
supernatant was combined with 1% (v/v) buffer B (50 mM 
HEPES, 300 mM NaCl, 2S0 fflM Imidazol, pH 8,0) . Protein 
solutions were applied on a Ni 2+ -NTA-agarose column 
(Qiagen, D-Hilden) previously equilibrated with 1% (v/v) 
buffer B , Flow rate was 0,75 mL/min. After the non-His 6 - 
tagged proteins had passed through the column, it was 
washed with 1% buffer B for another 10 min before a 
linear gradient was applied (30 min to 30% B, an 
additional 10 min to 100% B) . All three proteins eluted 
at a concentration of about 5% buffer B (15 mM Imidazol) 
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and were coll cted as 2 mL fractions. 

Fractions containing the recombinant 
proteins were detected using the Bradford reagent, by 
the absorption at 595 nm. These fractions were pooled 
5 and dialysed against a buffer containing 50 mM HEPES, 
100 mM NaCI, 10 mM MgCl 3 and 5 mM DTE for 16 hours. 

After dialysation protein concentrations were again I 
determined, i 
Till further usage proteins were stored at - \ 
10 2 0°C after addition of glycerol to 10% (v/v) . 

Prom 1 L culture approximately 5 mg of each 
pure recombinant protein could be obtained. Grade of 
purification was estimated to be 95% by SDS-PAGE. 

15 

j 

Analysis of ensymatic activity 

i 

ATP-PPi -exchange reaction: 

... ^Specificity of amino. acid. activation. was _ 

determined indirectly by incorporation of labelled 3a PPi 

2 0 into ATP during reverse reaction (Lee/ S.G. & Lipmann, 
F. ; Tyrocidine synthetase system; Methods Enzymol* 43 , 
1975, p. 585-602) . For this purpose 20 pmol of each 
enzyme was incubated with 1 mM amino acid, 1 mM ATP, 0,1 
mM FPi, 50 mM HEPES, 100 mM Nacl and 10 mM MgCl 2 and 2 

2 5 mCi 32 PPi at 3 7°C in a total volume of 100 jiL. Reactions 
were quenched after 10 min by adding 500 nL of a 
solution containing 100 mM NaP? t , 560 mM perchloric acid 
and 1,2% (w/v) active charcoal. The mixture was 
centrifuged at 13000 rpm for 1 min. The pellet was 
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washed and re suspended twice with 1 mL K 2 0. 

Incorporation of labelled ATP (adsorbed to 
the charcoal) was detected by measuring radioactivity of 
the precipitate. 
5 Asp-Leu-tfifl$ was shown to activate Asp and Leu 

exclusively. K H values for Asp and ATP were determined to 
be 3,5 mM and 0,9 mM respectively, values for Leu and 
ATP were detected to be 0,3 mM and 0,6 mM, respectively. 

Asp- Phe -*fis g and Asp-Phe-TE-His s were shown 
10 to activate both Phe and Asp. The K H value for Phe was 
determined to be about 50 fiM. 

The amino acid activation patterns of Asp- 
Phe-Jfis 6 and Asp-Phe-TE-J*is ff were found to be identical. 

15 Coval ent binding of amino acids to the Asp- Phe 
dipeptide synthetase: 

Quantity of holo-enzyme formation was 

determined by measuring ..the amount of label led . Asp, Leu 

and Phe, that could be covalently bound to one 
20 equivalent of the purified proteins Asp-Leu-#is 3 , Asp- 
Phe-tfisj and Asp-Phe-TE-tfis 6 . (Lee, S,G.; 9ee above). 

50 pmol of each enzyme was incubated with 2 
mM ATP, 50 mM HEPES, 100 mM NaCl, io mM MgCl 2 and 100 
pmol of " 4 C labelled amino acid (Asp and Leu or Asp and 
25 Phe respectively) at 37°c. After 30 min the reaction was 
quenched by the addition of 1 mL of 10% TCA and 5 mg/mL 
BSA and subsequently stored at 0*C for another 30 min. 
The protein precipitates were collected by 
centrifugation (30 min at 13000 rpm) and washed twice 




with 10% TCA. The washed precipitates were dissolved in 
50% performic acid and used to measure incorporation of 
labelled amino acid. 

Aep-Leu-#is 6 could be labelled with Asp and 
5 Leu to a degree of approximately 20-25%, Asp- Phe -His, 
and Asp-Phe-TE-tfis s could be labelled with Asp to a 
degree of only 10-15%. The incorporation of Phe reached 
a level of approximately 50%, 

10 Formation of the dipeptides Asp-Leu and Asp-Phe; 

Covalent binding of the constituent amino 
acids to the dipeptide synthetases was shown by kinetic 
experiments using radioactively labelled Asp. In a 
first assay 0,85 (iM Asp -Leu- Hi s« were incubated with 2 

15 mM ATP, 2,8 Asp { U C, 56 nCi) in a buffer (50 mM 

HEPES, 10 mM MgCl 2 , 100 mM Nad, pH 8,0) at 37 °C. At 
regular time intervals samples were taken and treated 
as indicated in the previous paragraphs radioactivity 
(of L-Asp covalently bound to the enzyme) was measured 

20 in each sample. After 4 minutes, the amount of 

incorporated labelled Asp started level ling- off to 
reach a maximum a few minutes later if no second amino 
acid was added. If, however, 1,5 HM of Leu were added 
after 4 minutes, a further strong, buc temporary, 

25 increase of radioactivity was observed, which sharply 
decreased after about 5 minutes to a level below the 
maximum observed when no second amino acid was added. 

In another experiment 0,5 (xM Asp-Phe-His e 
were incubated in the same way as Asp-Leu -#is 6 before . 
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In this case the addition of 0,1 mH Phe after about 5 
minutes resulted in a similar temporary increase of 
covalently bound radioactive L-Asp, as described above. 
However, the addition of Leu instead of Phe did not 
S lead to such temporary increase. 

This clearly shows that peptide bond 
formation takes place on each of these peptide 
synthetases. Furthermore, the results show that the 
peptides formed on the peptide synthetases are being 
10 released therefrom. 




CLAIMS 

1. Method fox the microbiological production of a-L- 
aspartyl-L-phenylalanine (Asp-Phe) from the 

5 substrates L-aepartic acid (L-Asp) and L- 

phenylalanine (L-Phe) characterised in that the 
substrates are contacted, in the presence of an 
effective amount of adenosine-triphosphate (ATP) , 
with a non-ribosomal dipeptide synthetase 

XO comprising two minimal modules connected by one 

condensation domain wherein the N- terminal module 
of these modules is recognising L-aspartic acid 
and the C- terminal module of these modules is 
recognising L -phenylalanine and is covalently 

15 bound at its N-termlnal end to the condensation 

domain, and wherein each of these minimal modules 
is composed of an adenylation domain and a 4t 1 - 

phosphopantetheinyl -cof actor containing thiolat ion 

domain, and that the a-L-aspartyl-L-phenylalanine 

2 0 (Aep-Phe) formed is recovered. 

2. Method for the production of Asp-Phe according to 
claim 1, characterised in that the condensation 
domain in the dipeptide synthetase is connected to 
both minimal modules in such way that it is also 

25 covalently bound to the module recognising L- 

aspartic acid. 

3. Method for the production of Asp-Phe according to 
claim 1 or 2, characterised in that also a 



releasing factor is present for the Asp-Phe formed 
on the dipeptide synthetase. 

Method for the production of Asp-Phe according to 
any of claims 1 to 3, characterised in that the 
releasing factor is a protein which ©hows 
thicesteraee-like functions and forms an 
integrated domain of the dipeptide synthetase at 
the C- terminus thereof. 

Method for the production of Asp-Phe according to 
any of claims 1 to 4, characterised in that prior 
to the production of Asp-Phe the dipeptide 
synthetase has undergone optimisation for its 
function by using one or more post-translational 
modifying activities. 

Method for the production of Asp-Phe according to 
claim 5, characterised in chat as an post- 
translational modifying activity a 4'- 
.phosphopantetheinyl „.(4 • -PP). transferase -is used.. 
Method for the production of Asp-Phe according to 
any of claims 1 to 6, characterised in that also a 
non- integrated protein with thioesterase Type -II- 
like activity is present together with the 
dipeptide synthetase - 

Method for che production of Asp-Phe according to 
any of claims 1 to 7, characterised in that the 
dipeptide synthetase is present in living cell- 
material of a micro-organism, and that glucose, L- 
Asp and/or L-Phe are being fed to said fermentcr, 
and that the Asp-Phe formed is recovered. 



- 48 



9. Method for the production of Asp-Fhe according to 
claim 8 characterised in that the micro-organism 
is first grown in a fermentor to reach a 
predetermined cell density before the expression 

5 of the Asp-Phe dipeptide synthetase is switched on 

and feeding of the glucose, L-Asp and/or L-Phe for 
the synthesis of the Asp-Phe dipeptide is started. 

10. Method for the production of Asp-Phe according to 
claim 9, characterised in that the micro-organism 

10 is an L-ph«nyl alanine producing micro-organism and 

that only glucose and L-Asp are being fed. 

11. Method for the production of Asp-Phe according to 
claim 10, characterised in that the micro-organism 
is an Escherichia or Bacillus species, 

15 12. Method for , the production of Asp-Phe according to 
any of claims 8 to 11, characterised in that the 
micro-organism used is a strain with reduced 
- protease activity fcr -Asp-Phe- or lacking such 
activity towards Asp-Phe. 

20 13 . Method for the production of Asp-Phe according to 
any of claims 8 to 12, characterised in that the 
micro-organism used also contains a suitable 
export system for Asp-Phe formed and/or one or 
more suitable up-take system(s) for glucose and/or 

25 L-Asp and/or L-Phe. 

14. Method for the production of Asp-Phe according to 
any of claims 1 to 7, characterised in that the 
production of Asp-Phe is carried out in vitro in 
an enzyme reactor, while ATP is supplied, and L- 



Asp and/or L-Phe are being fed, and the Asp-Phe 
formed is recovered. > 

Method for the production of Asp-Phe according to 
claim 14, characterised in that the supply of ATP 
is provided in part by an in situ ATP- regenerating 
system. 

Method for the production of Asp-Phe according to 
claim 15, characterised in that the ATP - 
regenerating system is present in a permeabilised 
micro-organism. 

A DNA fragment or a combination of DNA fragments 
coding for a non-ribosomal Asp-Phe dipeptide 
synthetase, which synthetase comprises two minimal 
modules connected by one condensation domain 
wherein the N- terminal module of these modules is 
recognising L-aspartic acid and the C-terrninal 
module of these modules is "recognising L- 
phenylalanine and is covalently bound at its N — 
terminal end to the condensation domain, and 
wherein each of these minimal modules is composed 
of an adenylation domain and a 4 ' -phospho- 
pancetheinyl cofactor containing thiolation 
domain. 

A DNA fragment coding for an Asp-Phe dipeptide 
synthetase according to claim 17, characterised in 
that the condensation domain in the encoded 
dipeptide synthetase is connected to both minimal 
modules in such way that it is also covalently 
bound to the module recognising L-aspartic acid. 
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19. A DNA fragment according to claim 17 or 18, or a 
combination of DNA fragments according to claim 
17, characterised in that the DNA fragment or 
combination of DNA fragments encoding the 

5 dipeptide synthetase also code for a releasing 

factor for the Asp-Phe formed on that dipeptide 
synthetase . 

20. A DNA fragment or a combination of DNA fragments 
according to any of claims 17 to 19, characterised 

10 in that it/they also code for a protein which 

shows thioesterase-like functions and forms an 
integrated domain of the dipeptide synthetase at 
the C- terminus thereof* 

21. A DNA fragment or a combination of DNA fragments 
15 according to any of claims 17 to 20, characterised 

in that it/they also express (es) a post- 
translational modifying activity for efficient 

non-ribosomal synthesis of Asp-Phe on the ..„ 

synthetase , 

20 22. A DNA fragment or a combination of DNA fragments 
according to claim 21, characterised in that the 
post - translational modifying activity expressed is 
a 4 1 -phosphopantetheinyl (4 1 -pp) transferase 
activity. 

25 23. A DMA fragment or a combination of DNA fragments 

according to any of claims 17 to 22, characterised 
in that it/they also code for a non-integrated 
protein with thioeeterase Type-II-lilce activity. 
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A micro-organism containing a DNA fragment or a 
combination of DNA fragments according to any of 
claims 17-23. 

A micro-organism according to claim 24 wherein the 
micro-organism is capable of producing L-Asp 
and/or L-Phe. 

A micro-organism according to claim 25 wherein the 
micro-organism is an Escherichia coli or Bacillus 
species . 

Asp-Phe dipeptide synthetase characterised in that 
it comprises cwo minimal modules connected by one 
condensation domain wherein the N- terminal module 
of these modules is recognising L-aspartic acid 
and the C- terminal module of these modules is 
recognising L- phenyl alanine and is covalently 
bound at its N-terminal end to the condensation 
domain/ and wherein each of these minimal modules 
is composed of an adenylation . domain and a 4'- 
phosphopantetheinyl cofaccor containing thiol at ion 
domain. 

Asp-Phe dipeptide synthetase according to claim 27 
characterised in that the condensation domain in 
the dipeptide synthetase is connected to both 
minimal modules in such way that it is also 
covalently bound to the module recognising L- 
aspartic acid. 

Asp-Phe dipeptide synthetase according to claim 27 
or 28 , characterised in that the dipeptide 
synthetase also comprises a releasing factor for 
the Asp-Phe formed on that dipeptide synthetase. 
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30. Asp-Phe dipeptide synthetase according to claim 

29, characterised in that the releasing factor is 
a protein which shows thioesterase-like functions 
and forms an integrated domain of the dipeptide 
synthetase at its C-terminus. 
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The present invention relates to a new, 
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10 



15 



microbiological, method for the production of a-L- 
aspartyl-L-phenylalanine (Asp-Phe) from the substrates 
L-aspartic acid (L-Asp) and L-phenylalanine <L-Phe) 
wherein the substrates are contacted, in the presence 
of ATP, with a non-ribosomal dipeptide synthetase 
comprising two minimal modules connected by one 
condensation domain wherein the N- reep- C- terminal 
modules are recognising L-Asp and L-Phe, respectively, 
and the latter module is covalently bound at its N- 
terminal end to the condensation domain, and wherein 
each of these minimal modules is composed of an 
adenylation domain and a 4 1 -phosphopantetheinyl 
cofactor containing thiolation domain, and that the 
Asp-Phe formed is recovered. The present invention also ' 
-relates to novel DMA fragments or combination of DNA 
fragments encoding a new Asp-Phe dipeptide synthetase, 
micro-organisms containing such DNA fragments, as well 
as to the new Asp-Phe dipeptide synthetases itself. 



